Loadstar: A Load Shedding Scheme for Classifying Data Streams

نویسندگان

  • Yun Chi
  • Philip S. Yu
  • Haixun Wang
  • Richard R. Muntz
چکیده

We consider the problem of resource allocation in mining multiple data streams. Due to the large volume and the high speed of streaming data, mining algorithms must cope with the effects of system overload. How to realize maximum mining benefits under resource constraints becomes a challenging task. In this paper, we propose a load shedding scheme for classifying multiple data streams. We focus on the following problems: i) how to classify data that are dropped by the load shedding scheme? and ii) how to decide when to drop data from a stream? We introduce a quality of decision (QoD) metric to measure the level of uncertainty in classification when exact feature values of the data are not available because of load shedding. A Markov model is used to predict the distribution of feature values and we make classification decisions using the predicted values and the QoD metric. Thus, resources are allocated among multiple data streams to maximize the quality of classification decisions. Furthermore, our load shedding scheme is able to learn and adapt to changing data characteristics in the data streams. Experiments on both synthetic data and real-life data show that our load shedding scheme is effective in improving the overall accuracy of classification under resource constraints. keywords: data mining, data streams, load shedding, classification, quality of decision, feature prediction, Markov model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Loadstar: Load Shedding in Data Stream Mining

In this demo, we show that intelligent load shedding is essential in achieving optimum results in mining data streams under various resource constraints. The Loadstar system introduces load shedding techniques to classifying multiple data streams of large volume and high speed. Loadstar uses a novel metric known as the quality of decision (QoD) to measure the level of uncertainty in classificat...

متن کامل

A New Adaptive Load-Shedding and Restoration Strategy for Autonomous Operation of Microgrids: A Real-Time Study

Islanding operation is one of the main features of a MicroGrid (MG), which is realized regarding the presence of distributed energy resources (DERs). However, in order to deal with the control challenges, which an MG faces during island operation, particularly when the transition is associated with certain excessive load, an efficient control strategy is required. This paper introduces a Centra...

متن کامل

ارائه الگوریتم حذف بار وفقی جهت حفاظت سیستم قدرت در مقابل حوادث ترکیبی منجر به خاموشی سراسری

In recent years several catastrophic power systems blackouts have occurred worldwide. Various reasons have been declared for these failures. Economical limitations due to power system restructuring restrictions, inadvertent operation of protective relays and inefficient design of conventional load shedding schemes are of the most important reasons causing these blackouts. In fact, due to both e...

متن کامل

Supervisory Control of a Hybrid AC/DC Micro-Grid with Load Shedding Based on the Bankruptcy Problem

In this paper, a supervisory controller is proposed to manage the power flow in a hybrid AC/DC micro-grid for both grid-connected and disconnected modes. When the hybrid AC/DC micro-grid is connected to the utility grid, power surplus or shortage leads to power trade between the micro-grid and the utility grid. In the grid-disconnected mode, the renewable power sources (wind and solar generatio...

متن کامل

Load Shedding in Network Monitoring Applications

Monitoring and mining real-time network data streams are crucial operations for managing and operating data networks. The information that network operators desire to extract from the network traffic is of different size, granularity and accuracy depending on the measurement task (e.g., relevant data for capacity planning and intrusion detection are very different). To satisfy these different d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005